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A unit for and method of image conversion 



The invention relates to an image conversion unit for converting a first image 
sequence, comprising a first image with a first resolution and a second image with the first 
resolution into a second image sequence comprising a thurd image with a second resolution, 
the image conversion unit comprising: 

- a coefficient-calculating means for calculating a first filter coefficient on 
basis of pixel values of the first image; 

- an adaptive filtering means for calculating a third pixel value of the third 
image on basis of a first one of the pixel values of the first image and the first filter 
coefficient. 

The invention fiirttier relates to a method of converting a first image sequence, 
comprising a first image witii a first resolution and a second image with the first resolution 
into a second image sequence comprising a third image with a second resolution, the metihod 
comprising: 

- calculating a first filter coefficient on basis of pixel values of the first image; 

and 

- calcxilating a third pixel value of the third image on basis of a first one of the 
pixel values of the first image and the first filter coefficient. 

The invention fiirther relates to an image processing apparatus comprising: 

- receiving means for receiving a signal corresponding to a first image 

sequence; and 

- the above mentioned image conversion unit for converting the first image 
sequence into a second image sequence. 



The advent of HDTV emphasizes the need for spatial up-conversion 
techniques that enable standard definition (SD) video material to be viewed on high 
definition (HD) television (TV) displays. Conventional techniques are linear interpolation 
methods such as bi-linear interpolation and methods tising poly-phase low-pass interpolation 
filters. The fomier is not popular in television appUcations because of its low quality, but the 
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latter is available in commercially available ICs. With the linear methods, the nmnber of 
pixels in the frame is increased, but the high firequency part of the spectrum is not extended, 
i.e. the perceived sharpness of the image is not increased. In other words, the capability of the 
display is not fully exploited. 
5 Additional to the conventional linear techniques, a nxmiber of non-linear 

algorithms have been proposed to achieve this up-conversion. Sometimes these techniques 
are referred to as content-based or edge dependent spatial up-conversion. Some of the 
techniques are already available on the consumer electronics market. 



10 

An embodiment of the image conversion unit of the kind described in the 
opening paragraph is known from the article **New Edge-Directed Interpolation", by Xin Li 
et al., in IEEE Transactions on Lnage Processing, Vol. 10, No 10, October 2001, pp. 1521- 
1527. In this image conversion umt, the filter coefQcients of an int^olation up-conversion 
15 filter are adapted to the local image content. The interpolation up-conversion filter apertm-e 
uses a fourth order intecpolation algorithm as specified in Equation 1 : 

with F^(iyj) the luminance values of the HD output pixels, F^piUJ) the limiinance values 
of the input pixels and Wg the filter coefficients. The filter coefficients are obtained &om a 

20 larger ^erture using a Least Mean Sqiiares (LMS) optimization procedure. In tlie cited 
article is e>q)lained how the filt^ coefficients are calculated. The method according to the 
prior art is also explained in connection with Fig. 1 A and Fig. IB. The method aims at 
interpolating along edges rather than across them to prevent blurring. The authors make the 
sensible assumption that edge orientation does not change with scaling. Therefore, the 

25 coefficients can be approximated from the SD input image within a local window by using 
the LMS method. 

Although the **New Edge-Directed Interpolation" method according to the 
cited prior art works relatively well in many image parts, there is a problem with selecting the 
appropriate window for the LMS method. For windows of size n by w, there are («-2)(m-2) 
30 equations. Experimentally, the inventor found that a window of 4 by 4, which results in 4 
equations did not lead to a robust up scaling. Better results have been obtained using 
windows of 8 by 8, i.e. with 36 equations. Although the up-conversion was more robust. 
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there was also more bluiring. It is assumed that this is due to the fact that the image statistics 
are not constant over this larger area, which causes the filter to conveage towards a plain 
averaging filter. To conclude: there is a conflict that complicates the choice of the window 
size. On the one hand, because of the robustness the window size has to be large. On the 
other hand, for constant image statistics the window size has be as small as possible. Finally, 
the IMS optimization requires at least the same number of equations as there are unknown 
coefficients, which gives a lower boimd to the window size- 



It is an object of the invention to provide an image conversion unit of the kind 
described in the opening paragraph which is relatively robust while the amount of image blur 
is relatively low. 

Tliis object of the invention is achieved in that the coefficient-calculating 
means is arranged to calculate the first filter coefficient on basis of further pixel values of the 
second image. In other words the q)erture of the coefficient-calculating means is eidarged in 
the temporal domain rather than in the spatial domain. The assumption then is that in 
corresponding -smaller- image parts of different images, the statistics are more similar tiian in 
different locations of a -larger- part in the same image. This is particularly to be expected in 
the case that the corresponding image parts are taken along the motion trajectory. So, 
additional to the assumption that edge orientation is independent of scale, it is now assumed 
that edge orientation is constant over time when corrected for motion. Pixel values are 
luminance values or color values. 

Notice that the further pixel values are not applied in the direct path of 
processing the input pixels of the first image into output pixels, i.e. the pixels of the third 
unage, but in the control path to determine tiie filter coefficients. Combming uq)ut pixel 
values of multiple input fields into a single output pixel value of a smgle output image, i.e. 
fiame, is for instance known as de-interlacing, hiterlacing is the common video broadcast 
procedure for transmitting the odd and even numbered image lines alternately. De-interlacing 
attempts to restore the full vertical resolution, i.e. make odd and even lines available 
simultaneously for each image. The purpose of de-interlacing is tiie reduction of alias in 
successive fields. However a pvupose of the image conversion unit accordmg to ihe present 
invention is to increase the resolution of input images on basis of respective input images. 
This is done by means of a spatial filter which is adapted to edges in order to limit the 
amount of blur which would arise without the adaptation to the edges. The spatial filter in 
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controlled by means of filter coefficients which are determined on basis of multiple input 
images. 

An embodiment of the image conversion unit according to the invention is 
arranged to acquire the pixel values of the first image from a first part of the first image and 
the further pixel values of the second image from a second part of the second image, with the 
first part and the second part spatially corresponding. An advantage of this embodiment is 
that it is relatively simple. Acquisition of the appropriate pixels from the second image is 
straight forward without additional calculations. Temporarily storage of a number of pixel 
values of the second image is required. 

An embodiment of the image conversion unit according to the invention is 
arranged to acquire the pixel values of the first image from a first part of the first image and 
the fiirfher pixel values of the second image &om a second part of the second image, with the 
first part and the second part at a motion trajectory. Motion vectors have to be provided by 
means of a motion estimator. These motion vectors describe the relation between flie first 
part and the second part. An advantage of this embodiment is that the images of the second 

sequence, i.e. the output images, are relatively sharp. 

In an embodiment of the image conversion \mit according to the invention tiie 

coefficient-calculating means is arranged to calcidate the first filter coefficient by means of 

an optimization algorithm. Preferably the optimization algorithm is a Least Mean Square 

algorithm. An LMS algorithm is relatively simple and robust. 

It is a fiuther object of the invention to provide a method of the kind described 

in the opening paragraph which is relatively robust while the amount of image blur is 

relatively low. 

This object of ttie invention is achieved in tiiat the first filter coefificimt is 
calculated on basis of fiirther pixel values of the second image. 

It is a fiirfher object of the invention to provide an image processing apparatus 
of the kind described in the opening of which the image conversion unit is relatively robust 
while the amount of image blur is relatively low. 

This object of the invention is achieved in that the coefficient^-calculating 
means of the image processing apparatus is arranged to calculate the first filter coefficient on 
basis of fiirther pixel values of the second image. The image processing apparatus optionally 
comprises a display device for displaymg the second image. The image processing apparatus 
might e.g. be a TV, a set top box, a VCR (Video Cassette Recorder) player or a DVD CDigital 
Versatile Disk) player. 



wo 2004/028158 PCT/IB2003/003563 

5 

Modifications of image conversion unit and variations thereof may correspond to 
modifications and variations thereof of the method and of the image processing apparatus 
described. 



These and other aspects of the image conversion unit, of the method and of the 
ma&t processing ^paratus according to the invention wiU become apparent firom and wiU be 
elucidated with respect to the implementations and embodiments described hereinafter and 
widi reference to the accompanying drawings, whwein: 

Fig. 1 A schematically shows an embodiment of the image conversion unit 

according to the prior art; 

Fig. IB schematically shows a number of pixels to explain the method 

according to the prior art; 

Fig. 2A schematicaUy shows two images to explain an embodiment of the 

15 method according to the invention; 

Fig. 2B schematically shows two images to explain an alternative embodiment 

of the method according to the invention; 

Fig. 2C schematically shows an embodiment of the image conversion unit 

according to the invention; 
20 Fig. 3A schematically shows an SD input image; 

Fig. 3B schematically shows the SD input image of Fig. 3A on which pixels 

are added in order to increase the resolution; 

Fig. 3C schematicaUy shows the image of Fig. 3B after being rotated over 45 

degrees; 

25 Fig. 3D schematicaUy shows an HD output image derived fmm the SD input 

image of Fig. 3A; and 

Fig. 4 schematicaUy shows an embodiment of the image processing apparatus 

according to the invention. 

Same reference numerals are used to denote similar parts throughout the figures. 

30 



Fig. lA schematicaUy shows an embodiment of the image conversion unit 100 
according to ttie prior art. The image conversion unit 100 is provided with standard definition 
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(SD) images at the input connector 108 and provides high definition (HD) images at the 
output connector 110. The image conversion unit 100 comprises: 

- A pixel acquisition unit 102 which is arranged to acquire a first set of pixel 
values of pixels 1-4 (See Fig. IB) in a first neighborhood of a particular location within a first 
one of the SD input images which corresponds with the location of an HD output pixel and is 
arranged to acquire a second set of pixel values of pixels 1-16 in a second neighborhood of 
the particular location within the first one of the SD input images; 

- A filter coefficient-calculating unit 106, which is arranged to calculate filter 
coefficients on basis of the first set of pixel values and the second set of pixel values. In other 
words, the filter coefficients are approximated fix>m the SD input image within a local 
window. This is done by using a Least Mean Squares (LMS) method which is explained in 
connection with Fig. IB. 

- An adaptive filtering unit 104 for calculating the pixel value of the HD 
output pixel on basis of the first set of pixel values and the filter coeflicients as specified in 
Equation 1. Hence the filter coefficient-calculating unit 106 is arranged to control the 
adaptive filtering unit 104. 

Fig. IB schematically shows a number of pixels 1-16 of an SD input image 
and one HD pixel of an HD output image, to explain the method according to the prior art. 
The HD output pixel is interpolated as a weighted average of 4 pixel values of pixels 1-4. 
That means that the luminance value of the HD output pixel F^^j results as a weighted sum 
of the luminance values of its 4 SD neighboring pixels: 

Phd =w,F^(1) + >vA(2) + m;3F^(3) + iv,F^(4^ (2) 
where F^{y)\o ^sdW arethepixel values of the 4 SD input pixels 1-4 and Witow4arethe 
filter coefficients to be calculated by means of the LMS method. The authors of the cited 
article in which the prior art method is described, make the sensible assxunption tiiat edge 
orientation does not change with scaling. The consequence of this assumption is that the 
optimal filter coefficients are the same as those to interpolate, on the standard resolution grid: 

- Pixel 1 fix)m 5, 7, 1 1, and 4 (that means that pixel 1 can be derived firom its 4 

neighbors) 

- Pixel 2 fi:om 6, 8, 3, and 12 

- Pixel 3 firom 9, 2, 13, and 15 

- Pixel 4 firom 1, 10, 14, and 16 
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This gives a set of 4 linear equations from which wilh the LSM-optimi2ation ihe optimal 4 
filter coefficients to interpolate the HD output pixel are found. 

Denoting M as the pixel set, on the SD -grid, used to calculate the 4 weig^its, 
the Means Square Error (MSB) over set M in the optimization can be written as the sum of 
5 squared differences between original SD -pixels Fgo and interpolated SD -pixels Fsj : 

MSE= i;(F«,(2i + 2.2y + 2)-F„(2z + 2,2y + 2))^ (3) 

Which in matrix formulation becomes: 

Here y contains the SD -pixels in M (pixel Fg^M to FgoM, Fso{2,l) to F^(2,4). 
10 F^(3,l) to F^(3.4), Fa,(4,l) to F^(4,4) and C is a 4xM^ matrix whose A:*rowis 

composed of the wei^ted sum of the four diagonal SD -neighbors of each SD -pixels in y . 

The weighted sum of each row describes a pixel F^ , as used in Equation 3. 
To find the minimum MSB , i.e. LMS, the derivation of MSB over w is calculated: 



djMSE) _ p (5) 
dw 

15 -2j)c+2hC*=0 

w=(c^c)-(c^;) (7) 



By solving Equation 7 the filter coefiBcients are found and by using Equation 2 the pixel 
values of the HD ou^ut pixels can be calculated. 

In this example a window of 4 by 4 pixels is used for the calculation of the 
20 filter coefficients. An LMS optimization on a larger windo>y, e.g. 8 by 8 instead of 4 by 4 
gives better results. 

Fig. 2A schematically shows two SD input images 202, 204 to explain an 
embodiment of the method according to the invention. Each of the two SD input images 202, 
204 comprises a number of pixels, e.g. 210-220 which are indicated with X-signs. Suppose 
25 that an HD output pixel has to be calculated. The location corresponding to this HD ou^ut 
pixel is indicated in a first one of the input unages 202. For the calculation of a first filter 
coefficient, e.g. with which SD input pixel 212 has to be multipUed, a set of equations has to 
be solved as explained in comiection with Fig. IB. The known components of these equations 
correspond with pixel values e.g. 210-215 taken fix>m a first part 206 of the first one of the 
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input images 202, but also with pixel values e.g. 216-220 taken ftom a second part 208 of a 
second one of the input images 204. Preferably the pixel values which are used to determine 
the first filter coefficient of a particular pixel 212 are acquired &om the local nei^borhood. 
That means that the pixels which are connected to the particular pixel 212 are appUed, e.g. 
) tiie upper, the lower, the right, the left and the diagonal pixels. The pixel values of the second 
image are also acquired firom a local neighborhood which corresponds to the local 
neighborhood in the first hnage. The first part 206 of the first one of the mput images 202 
and the second part 208 of the second one of the input images 204 are spatially 
correspondmg. That means that all respective pixels of the first part 206 have the same 
0 coordinates as the con«sponding pixels of the second part 208. That is not the case with the 
images parts 206 and 222 as depicted in Fig. 2B. 

Fig. 2B schematically shows two images 202, 204 to explain an alternative 
embodiment of the method according to the invention. The first part 206 of the first one of 
the input images 202 and the &ird part 222 of the second one of the mput images 204 are 
15 located at a motion trajectory. The relation between the first part 206 and the third part 222 is 
determined by the motion vector 230 which has been calculated by means of a motion 
estimator. This motion estimator might be the motion estimator as described in the article 
'True-Motion Estimation with 3-D Recursive Search Block Matching" by G. de Haan et. al. 
in IEEE Transactions on circuits and systems for video technology, vol.3, no.5, October 
20 1993, pages 368-379. In this case the respective pixels of the two image parts correspond to 
substantially equal picture content although there was movement of objects in the scene 
being imaged. 

Fig. 2C schematically shows an embodiment of the image conversion unit 200 
according to the invention. The image conversion unit 200 is provided with standard 

25 definition (SD) images at the input connector 108 and provides high definition (HD) images 
at the output connector 1 10. The SD input images have pixel matrices as specified in CCIR- 
601, e.g. 625*720 pixels or 525*720 pixels. The HD output images have pixel matrices with 
e.g. twice or one-and-a-halve times the number of pfacels in horizontal and vertical direction. 
The image conversion \mit 200 comprises: 

30 - A memory device for storage of a number of pixels of a number of SD mput 

images. 

- A pixel acquisition unit 102 which is arranged to acqviire: 



wo 2004/028158 PCT/IB2003/003563 

9 

- a first set of pixel values of pixels firom a first one of the SD input images in 
a first neighborhood of a particular location wilhin the first SD input image, which 
corresponds with the location of the ou^ut pixel HD. 

- a second set of pixel values of pixels firom the first SD input image in a 
5 second neighborhood of the particular location; 

- a third set of pixel values of pixels firom a second one of the SD input images 
in a third neighborhood of the particular location; 

- an optional fourth set of pixel values of pixels firom a third one of the SD 
ixupvLt images in a fourth neighborhood of the particular location. 

0 - A filter coefficient-calculating unit 1 06 which is arranged to calculate filter 

coefficients on basis of the first, second, third and optionally fourth set of pixel values. In 
other words, the filter coefficients are approximated from the SD iapvA images within a local 
window located in the first SD input image and the window extending to the second SD input 
image and optionally to the third SD mpvA image. Preferably the second SD ii^ut image and 

15 the third SD input image are respectively preceding and succeeding the first SD ii^ut image 
in the sequence of SD input images. The approximation of the filter coefficients is done by 
using a Least Mean Squares (IMS) method which is explained in connection with Fig. IB, 

Fig. 2A and Fig. 2B; and 

- An adaptive filtering unit 104 for calculating a pixel value of an HD output 
20 image on basis of the second set of pixel values. The HD output pixel is calculated as the 
weighted sum of the pixel values of the first set of pixel values. 

The image conversion unit 200 optionally comprises an input connector 114 
for providing motion vectors to be applied by the pixel acquisition unit 102 for the 
acquisition of pixel values in the succeeding SD input images of fiie SD input image 
25 sequence, which are on respective motion trajectories, as explained in connection with Fig. 
23. 

The number of pixels acquired in the neighborhood, i.e. the window size, 
might be even or odd, e.g. 4*4 or 5*5 respectively. Besides that the shape of the window 
does not have to be rectangular. Also the number of pixels acquired from the first image and 
30 the number of pixels acquired from the second image does not have to be mutually equal. 

The pixel acquisition unit 102, the filter coefficient-calculating unit 106 and 
the adaptive filtering unit 104 may be implemented using one processor. Normally, these 
fimctions are performed under control of a software program product. During execution, 
normally the software program product is loaded into a memory, like a RAM, and executed 
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from there. The program may be loaded from a background memory, like a ROM, hard disk, 
or magnetically and/or optical storage, or may be loaded via a network like Internet. 
Optionally an appUcation specific integrated circuit provides the disclosed functionaHty. 

To convert an SD input image into an HD output image a number of 
processing steps are needed. By means of Figs. 3A-3D these processing steps are explained. 
Fig. 3 A schematically shows an SD input image; Fig. 3D schematically shows an HD output 
image derived from the SD input image of Fig. 3A and Figs. 3B and 3C schematically show 

intermediate results. 

. Fig. 3 A schematically shows an SD input image. Each X-sign correspond 

with a respective pixel. 

- Fig. 3B schematically shows the SD input image of Fig. 3A on which pixels 
are added in order to increase the resolution. The added pixels are indicated with +-signs. 
These added pixels are calculated by means of interpolation of the respective diagonal 
neighbors. The filter coefficients for the interpolation are determined as described in 

coimection with Fig 2B. 

- Fig. 3C schematically shows the image of Fig. 3B after being rotated over 45 
degrees. The same image conversion unit 200 as being appUed to calculate the image as 
depicted in Fig. 3B on basis of Fig. 3 A can be used to calculate the image as shown in Fig. 
3D on basis of the image as depicted in Fig. 3B. That means that new pixel values are 
calculated by means of mterpolation of the respective diagonal neighbors. Notice that a first 
portion of these diagonal neighbors (indicated with X-signs) correspond to the original pixel 
values of tiie SD input image and that a second portion of these diagonal neighbors (indicated 
with +-signs) correspond to pixel values which have been derived &om the original pixel 
values of the SD input image by means of interpolation. 

- Fig. 3D schematically shows the final HD output image. The pixels that have 
been added in the last conversion st^ are indicated wifli o-signs. 

Fig. 4 schematically shows an embodiment of the image processing apparatus 
400 according to the invraition, comprising: 

- Receiving means 402 for recdving a signal representing SD images. The 
signal may be a broadcast signal received via an anteraia or cable but may also be a signal 
from a storage device like a VCR (Video Cassette Recorder) or Digital Versatile Disk 
(DVD). The signal is provided at the input comiector 408; 

- The image conversion unit 404 as described in connection with Fig. 2B; and 
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- A display device 406 for displaying the HD output images of the image 
conversion unit 200. This display device 406 is optional. 

The image processing apparatus 400 might e.g. be a TV. Alternatively the 
image processing apparatus 400 does not comprise the optional display device but provides 
HD images to an apparatus that does comprise a display device 406. Then the image 
t^- rising apparatus 400 might be e.g. a set top box, a satellite-tuner, a VCR player or a 
t> V p player. But it might also be a system being applied by a film-studio or broadcaster. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention and that those skilled in the art will be able to design alternative 
embodiments without departing from the scope of the q^pended claims. In the claims, any 
reference signs placed between parentheses shall not be constructed as limiting the claim. 
The word 'comprising* does not exclude the presence of elements or steps not listed in a 
claim. The word "a" or "an" preceding an element does not exclude tiie presence of a 
plurality of such elements. The invention can be implemented by means of hardware 
comprising several distinct elements and by means of a suitable programmed computer. In 
the unit claims enumerating several means, several of these means can be embodied by one 
and the same item of hardware. 



